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Abstract 

We introduce a novel notion of probability within quantum history 
theories and give a Gleasonesque proof for these assignments. This in- 
volves introducing a tentative novel axiom of probability. We also discuss 
how we are to interpret these generalised probabilities as partially ordered 
notions of preference and we introduce a tentative generalised notion of 
Shannon entropy. A Bayesian approach to probability theory is adopted 
throughout, thus the ajdoms we use will be minimal criteria of rationality 
rather than ad hoc mathematical axioms. 
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1 Introduction and Summary 

In ^ we postulated a novel notion of probability by generalising Cox's axioms 
of probability E] in a manner appropriate to quantum theory. In this paper 
we wish to go one step further; we will present a uniqueness proof analogous 
to Gleason's theorem for our postulated generalised probabilities. We will be 
helped considerably by another analogue of Gleason's theorem in the literature 
0] which is applied to the decoherence functional in the History Projection 
Operator (HPO) formulation of the consistent histories programme [H]- First we 
will review results previously discussed ^ and then we will outline the relevant 
Gleason-like theorem, its interpretation and, for completeness, its proof. We 
will then propose a generalised entropy. 

We will adopt a Bayesian approach to probability theory and we will use 
Cox's approach in particular 3:- Probabilities are usually considered real num- 
bers because of an association with relative frequencies. As soon as we adopt 
an approach to probability theory where we merely assign probabilities as an 
ordered notion of preference then there is absolutely no a priori reason to con- 
sider probabilities as real numbers. One might try to design 'zeroth' axioms of 
probability theory which end up ensuring that probabilities are in fact real num- 
bers, and then one might introduce further axioms to constrain how we assign 
these real numbers to propositions. Such an approach is, however, problematic 
because such 'zeroth' axioms are rather ad hoc. 



Consider some arbitrary propositions a, [3 and 7 to which we are to assign 
probabihties. Consider also a notion of ordering '>' to be defined on the proba- 
bihty space. Two possible 'zeroth' axioms [S], which constrain how this ordering 
notion is to behave, are 'universal transitivity': 

Axiom Oa: If p{a\I) > _p(/3|/) and p{f3\I) > p(7|/) 

thenp(a|/) >p(7|/), (1) 

and 'universal comparability': 

Axiom Ob: For all a, (3 we have that cither p{a\I) > p{(3\I) 

or p{a\I) < p{(3\I) or p{a\I) = p{f3\I). (2) 

Given these zeroth axioms it would seem natural (although it is still not 
strictly necessary) to use real numbers for probability assignments. Axiom Oa is 
often considered desirable because probabilities are intended to represent tran- 
sitive notions of preference in some sense. Axiom Ob is, however, far more 
dubious and there is a history in the literature of people trying not to assume it 
(see Appendix A. 3 of for references and also see 7 and jSj). Why presume 
that we can probabilistically compare all propositions universally, especially in 
quantum theory where some propositions are considered 'incompatible' or 'com- 
plimentary' ? It is prudent to not assume axiom Ob from the outset (it might 
be that we are forced to adopt it later). If we were to adopt axiom Ob then it 
might be that we will be introducing relationships between probabilities that we 
are not justified in invoking — and any problems like nonadditivity and so forth 
might be due to such a mistaken assumption. 

Rather than adopt these two controversial zeroth axioms let us, for the time 
being, use a weaker zeroth axiom that we can all surely agree on: 

Axiom 0' : If a < /? then, presumably, p{a\I) < p{(3\I), (3) 

where '<' is, in the least, a partial order in the context of both the proposition 
space and the space of generalised probabilities^. We call this axiom 'monoticity' 
[21 . So as to avoid confusion with standard probability theory (theories which 
obey axioms Oa and Ob) it is prudent to call any assignment which merely obeys 
the weaker zeroth axiom (O by another name: we will call them 'pedagogical 
examples' or 'pegs'. Probabilities are then special examples of pegs. 

Our task in this paper is then to find a peg theory for a histories algebra. 
We will use the histories propositional algebra 'P(V), where V = ®^TL, which 
was originally introduced by Isham 0. The natural connectives on this space 
of projection operators are the standard A, V, -< connectives and we use the 
standard partial order < upon it A homogeneous history proposition a is 
defined as a time ordered tensor product of projection operators at^ G T'{7i): 

a := at„{tn) ® at^_,{tn-i) ® ... (8 atiih). (4) 

^ In the context of standard probability theory we could use Venn diagrams to define a < fS 
using subset inclusion and we would define p{a\I) < p(l3\I) using the total ordering of real 
numbers. 
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We stay in the Heisenberg picture such that each projection operator has the 
dynamics already imphcit such that at^{tm) = U^(tm—tm-i)cttmU{tm—tm-i) 
where df^ are Schrodinger picture operators. Our novel peg assignments, those 
that we suggested in P, are 

pia\I) -.^trniC^p) (5) 

where Cq. — cit„ (t„)cit„_j (i„_i)...dtj (ii) and p is a density operator on Tt. We 
keep an explicit hypothesis / in the notation because such a thing is natural 
when discussing probabilities from a Bayesian perspective and it avoids 
us confusing peg assignments that are made given different prior information. 
Clearly these pegs might obey (|2Jl by relating the natural partial order on P{V) 
to some partial order on (D. Once we introduce further axioms that these pegs 
ought to obey — other than ^ — then we will be able to speculate what this 
partial order on the peg space might be. 

We proposed these pegs for the history algebra V{V) because they are addi- 
tive for disjoint homogeneous history propositions ^ — thus these complex pegs 
seem to behave something like we expect probabilities should. Now we aim to 
show that we can derive these pegs from axioms analogous to Cox's axioms of 
probability theory applied to the HPO algebra using an analogue of Gleason's 
theorem. 

2 A Gleason Analogue 

So, let us remind ourselves of what Gleason's theorem 11 tells us. Gleason's 
theorem is about trying to assign probabilities to a quantum propositional alge- 
bra. In standard quantum theory the relevant propositional algebra is taken to 
be ■P(W), the set of projection operators upon a Hilbert space Ti, where the nat- 
ural logical connectives on V{TC) are A, V, and we denote the standard partial 
order relation '<'. So, naturally, a probability assignment should obey certain 
rules which we shall use to define what is often, perhaps confusingly, called a 
state (we call it a state more because of what we end up proving). A state cr G 5 
is a real valued function on V(Ti.) which has the following properties: 

1. Positivity: a{P) > for all P E V{H), 

2. Additivity: if P and R are disjoint— P A ^ = 6— then a{P V R) = a{P) + 

3. Normalisation: (t(1) — 1, 

where G V{TL) is the proposition that is always false, and 1 G V{T() is the 
proposition that is always true. Gleason's theorem is simply that states assigned 
to ViTi), for AubH > 2, are in one-to-one correspondence with density matrices 
on Tt such that 

CTp(P) = tr(Pp) for aU P eV{n). (6) 

One takes the propositional algebra of projection operators, makes basic as- 
sumptions about how probabilities ought to behave, and one derives that such 
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probability assignments are in one-to-one correspondence with density matri- 
ces. The axioms of probabihty theory ensure the density matrix structure of 
quantum theory. 

In analogy with Gleason's theorem we should list a set of axioms that our 
pegs should obey and then try and derive the theorem from there. For these 
axioms, we argue, we should look to Cox's axioms of probability theory which 
ensure that we don't introduce functional relationships between peg assignments 
that aren't rationally justified. Cox's A-axiom is simply that the pegs we peg 
to propositions conjoined using the 'and' operation should be limited to be 
functionally dependent only on some very specific pegs: 

p{a^m■■^F[p{a\m.p{m] (7) 

where F is an arbitrary function that is sufficiently well-behaved for our pur- 
poses. Similarly, Cox's -i-axiom is that the peg we peg to the negation of a 
proposition should only functionally depend upon the peg of the proposition 
before it was negated: 

p{^a\I) := G[p{a\I)]. (8) 

These two axioms are criteria of rationality that are at the heart of Cox's ap- 
proach to probability theory and these are all he needed (except for the ad- 
ditional assumption that probabilities are real numbers) in order to prove the 
basics of probability theory as applied to a Boolean algebra of propositions. 

Cox's two axioms suggest we should use a peg that is additive for disjoint 
history propositions and that is normalised It turns out that this will not 
be sufficient for our peg theory; we will need a further axiom. Luckily one is 
forthcoming. Note that in the HPO formulation of history theories we have the 
three natural logical connectives A, V, which correspond roughly to 'and', 'or' 
and 'negation' operations (although with the standard non-distributivity issue 
we have in quantum theory). Clearly, however, when going from projection 
operators defined at a single-time to explicitly history orientated propositions 
there is another natural connective, namely changing the temporal order. As 
such we can define an operator M which reverses the order on any tensor product 
vector, M{vi ®V2® •■•«»«) := {vm ® Vm-i ® ■■■Vi). Thus the temporal reversal 
of the Heisenberg picture history proposition a is given by: 

<a MaM ^ at^{ti) ® at^{t2) ® ... at^{tn). (9) 

Note that < reverses both the kinematical and the dynamical temporal or- 
derings because we are in the Heisenberg picture (c/. |12|). Applying < twice in 
this manner obviously gives back the same history proposition; behaviour that 
is analogous to the 'negation' operation. Hence we should introduce a third peg 
axiom analogous to ((HJ: 

p{Ml) H[p[a\I)] (10) 

where H is an arbitrary function that is sufficiently well-behaved for our pur- 
poses. The peg we assign to such a proposition should be assigned in a non- 
contextual manner according to (|10|l . Clearly, by Youssef's argument [J], we 
could use complex numbers and still keep consistency with analogues of Cox's 
two axioms. Our tentative third axiom seems to give our pegs an 'extra degree 
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of freedom'. Using real numbers would not be a possibility for a peg that also 
obeys (|10|) and luckily we are using the uncontroversial weaker zeroth axiom 
Q. Hence it might be that we can find a peg which obeys Q, lO, © and 

In analogy with how we define states in Gleason's theorem let us define 
complex assignments as maps / : 'P(V) <C which obey the following rules: 

1. Conjugation. 1(a)* — l{<\a) for all a, 

2. Additivity. If a and (3 are disjoint then l{a V f3) = l{a) + l{(3), 

3. Normalisation. Z(l) = 1. 

We use the similar notation as we used for the Hilbert space Ti. because V is 
still a Hilbert space (although we leave the 'hats' off operators in V). This is an 
advantage of using Isham's HPO formulation of the history algebra [Sj- 

Note that the peg axioms (0), © and ifTIH) do not uniquely ensure that we 
must use the complex assignments I — ^just as Cox's axioms in standard probabil- 
ity theory don't ensure that we must use real numbers per se. The peg axioms 
ensure that, whatever assignments we do use for convenience, such assignments 
at least obey the relevant criteria of rationality. Hence we do not argue that 
the complex assignments / are uniquely the only pegs we could use, but clearly 
the maps I do obey {Tj), (jSJ) and H1U|) and might yet obey ||3J) for some partial 
order on (D. One might even argue that it is not the particular assignments 
that matter; it is the catalogue of functional relationships between them that 
are important (these we categorise axiomatically using analogues of Cox's ax- 
ioms). Nonetheless, it is convenient to use particular representations (just as it 
is convenient to use real numbers in standard Bayesian probability theory). 

Can we now start to tackle a proof of a Gleason-like theorem for these 
complex assignments 17 In fact, the result follows from an analogue of Gleason's 
theorem for decoherence functionals already in the literature |4j. 

Let us first review some identities 0]. Note that 

trniAiA2...An) = tr^,^H{Al ® ia «> -AnS) (11) 

where Am are arbitrary self adjoint operators on Ti. and S' is a linear operator 
S : — > 0""^ defined on product vectors by 

S{vi (X) U2 <8) ... ® v„) := W2 ^ W3 ® ... ® w„ (X) wi (12) 

and extended by linearity and continuity to give a unitary operator on ®"7i. 
We can swap between the Heisenberg and Schrodinger pictures using: 

trniCa) = tr^^niatr, "X) at„_i <E) ...at^S^). (13) 
where the ctt^ are Schrodinger picture projection operators and ;= (tjl (8> 

...^Ul)SiUi®...^Un). 

In an analogous way 0], we can absorb the initial state into an operator 
defined on V = '^"H using the identity fTT|l . Note that 
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tr7^(iii2...i„) 

= tr^-w((il ® ^ «) --in-l ® ln)(ll ® I2 --In-l (8)iri)S') (14) 

= tr55„«((ii ® ia «) ...i„-i ® l„)r) (15) 
= ti(s^-in{{Ai(E)A2(E)...An-iY') (16) 

where Y' is obtained from Y by tracing over a complete set of states for the nth 
Hilbert space. The form of Hll(l is preserved under removing an operator by the 
action of a partial tracing, and is also preserved when removing the dynamics 
from around each single-time proposition in the history proposition. 

So, using these identities we can absorb all the dynamics and initial state 
into some operator Z such that: 

trw(C'ap) =tr^^:«(at„ (8 at„„i... at^^p,//). (17) 

Note that the LHS of (|17|l is in the Heisenberg picture whereas the RHS is 
in the Schrodinger picture — we have split the dynamics and kinematics into 
distinct entities. There are good reasons for doing this as it would allow us to 
investigate the distinction between the two forms of temporal orderings |12j . 
But note that we can stay within the Heisenberg picture if we wish (for it is, by 
far, the preferable picture |13p: we keep the dynamics around the corresponding 
projection operators and absorb just the initial state into an operator Y: 

trn{CaP) = tr®,i-H(at„(t„) (X) at„_i(t„_i) (g) ...dti(ti)Fp). (18) 

The above ensures that we can put our tentative assignment into 'Gleason' 
form. Now we can prove an analogue of Gleason's theorem for such operators 
Yp. The theorem and proof follows the analysis in 4 almost word for word. 
There are, however, distinguishing features and, for completeness, we repeat the 
analysis here since the proof is so short. 

Theorem 

If dim V > 2, the complex assignments / are in one-to-one correspondence with 
operators y on V = 0"7i according to the rule 

1(a) = trv(ar) (19) 

with the restrictions that: 

a) ^ MYM (20) 

b) trv(r) = 1 (21) 

where M is an operator that reverses the order of the entries in a tensor product 

vector; M{vi » W2 <8) ■■■Vm) ■= [Vm ® Wm-l ® ...Vi). 
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Proof 



In one direction, the theorem is trivial; if a function I is defined by the right 
hand side of 119|l it clearly obeys the crucial additivity condition. The extra 
requirements H2()|l and H21|l ensure normalisation and conjugation requirements. 

Conversely, let I : V^V) — > (D be a complex assignment. The proof that it 
must have the form (|19|) exploits Gleason's theorem applied to 'P(V). 

Let Re I and Im I denote the real and imaginary parts of I so that 

l{a) = Re l{a) + Im l{a) (22) 

where Re l{a) € R and Im I (a) G R. The additivity condition on I means 
that Re l{a) and Im l{a) are additive functions on 7^(V), i.e., Re l{ai V 02) = 
Re l{ai) + Re l{a2) for any disjoint pair ai, a2 of projectors and similarly for 
Im I. We have that l{a) is a continuous function of its argument and hence 
a t—^ l{a) is a continuous function on V{V), as are its real and imaginary parts. 
However the set of all projectors in the finite dimensional space V is a finite 
disjoint union of Grassman manifolds and is hence compact. It follows that the 
functions a i— > Re l{a) and a ^ Im l{a) are bounded below and above. On the 
other hand, for any r G R, the quantity 

Krict) ■— rdim(Q:) — rtr(a) (23) 

is a real additive function of a, and hence so are Re I + Kr and Im / + for any 
r, s G R. We can choose an r such that Re I + Kr > for all a (and s such that 
Im I + Kg > 0) and due to an upper bound we can choose positive real scale 
factors /X and z/ such that for all a we have that 



< fiCRc I + Kr){a) < 1 (24) 
< iy{Im I + Ks){a) < I. (25) 

These inequalities plus the additivity property show that, for each a G 'P(V), 
the quantities a /^(Re I + Kr){a) and a t-^ i/(Im I + Ks)(q;) are states on the 
lattice V{V). Then Gleason's theorem shows that there exists a pair of density 
matrices and on V such that for all a G P{V), 

/^(Re / + Kr)(a) = trv(p^a) (26) 

i/(Im ; + Ks)(a) = trv(p^a) (27) 

and so 

Re l{a) = trv((-p^ - r)a) = trv(r^a) (28) 

Im l{a) = trv((i/9^ - s)a) = trv{Y^a) (29) 

where :— j^p^ — r and := -i-p^ — s. Thus we have shown the existence of 
a family of operators Y := + iY^ on V such that 

lp{a)=tiv{aYp) (30) 
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This completes the proof of the theorem because the conditions (|20|) and 1)21(1 
follow at once from the conjugation and normalisation conditions on complex 
assignments. □ 

We do not discuss any extensions to infinite dimensional cases. We add the 
subscript ptoYto emphasise that it depends upon the initial state; it is antici- 
pated that Yp can be decomposed into some operators on V which are universally 
defined (through relations between traces of products of operators and traces 
of tensor product operators) and some operators that are related to the initial 
state. Clearly the Y operators on V and density operators on H are intimately 
related, the task is now to investigate the properties and interpretation of these 
pegs. But, in the least, we can put our assignment |(SJ| into the form 13U|) for 
which we have an analogue of Gleason's theorem. 

One issue that we have to identify is that we have promoted the operation 
< to a connective on par with V, A, and it may not seem natural to some to 
do this. We considered it natural because we were going from a space ViH) 
which identified propositions at a single time to a space 'P(V) which explicitly 
identified history propositions. So we need a connective that can relate different 
temporal orderings. One might then query, why specifically <? Why not some 
other operation, like making any permutation of single-time entries? Staying 
in the Heisenberg picture ensures that the dynamics are already taken care 
of, and permuting the entries would mess up this fact — hence we only discuss 
a connective that maintains the dynamical relationships between single-time 
propositions. 

Note that we do not need to use the monoticity axiom © for Gleason-like 
proofs; it is, in some sense, redundant. Nonetheless, how might our pegs obey 
monoticity? In P(V) we have the following condition: 



For our pegs we have that p{0\I) = and = 1 and hence, by monoticity, 

we must, in the least, demand that 



One tentative partial order might look something like Fig.lQ. This partial or- 
der^ has the added advantage as we are unable to relate p{<ja\I) and p{a\I) 
using it (the partial ordering is symmetric in the real axis and complex conju- 
gation represents time reversal). Since, by the partial order on V{V), a NR <ia 
(where NR stands for 'not related to by the relevant partial order') we ought to 
demand this of the peg space as well, such that p{<Ja\I) NR p{a\I). Also, this 
partial order reduces to the standard probabilities when we move to the real 
line between and 1. 

Note, however, that there are many partial orders on (D and there might be 
another applicable one — we introduce the partial order represented by Fig.(E3) 

^ Equivalently, one can picture the same partial order as allowing many different paths from 
uncertainty to certainty such that these paths are symmetric in the real axis. These paths 
would look like the lines of magnetic flux between North (0) and South (1) magnetic poles in 
2D (not illustrated). This absolves us of Jaynes' argument against comparative probability 
theories — ones that don't obey axiom Ob — by allowing many dense paths from to 1, see the 
appendix of |6)). 



< a < 1 for aU a G P{V). 



(31) 



< p{a\I) < 1 for aU a E P{V). 



(32) 
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Figure 1: Contour lines of pegs of equivalent size by a suggested 
partial order on (D. The 'height' of each contour gets larger starting 
from the circle around until we reach the circle around 1. 



to emphasise that monoticity will provide constraints for the partial orders that 
we can use. For disjoint propositions a and /3 we have that a < hence the 
monoticity requirement ensures that p{a\I) < 1 — p{(3\I) which is satisfied as 
long as (|32|l is. The constraint (|32|l thus ensures a class of partial orders that 
might be useful and Fig.(^ seems the most apt. 

Even obeying all the peg axioms we have to hand it is still very difficult 
to interpret these pegs as 'probabilities' per se because we are taught again 
and again that probabilities are real numbers. However, remember that there 
is no a priori reason to regard probabilities as real numbers, they are merely 
magnitudes that we use in order to assign a partially ordered notion of preference 
to propositions. 

3 Discussion 

Having an analogue of Gleason's theorem for our pegs is not enough; we now 
need to argue why we should use such complex pegs in the first place. We have 
given an argument based on rejecting axiom Ob so now it is important to discuss 
the factitious problems that are solved by rejecting it. If we keep axiom Ob then, 
by the very fact that we will be comparing propositions in a manner that is not 
justified rationally, we will be introducing relations within the probability space 
that are not underpinned by relations in the proposition space. Hence we call 
such relations in the probability space 'factitious'. 

The language we have used here is perhaps quite telling. Our use of the 
term 'factitious' harks back to Einstein's use of the term. Clearly, if we adopt a 
relational approach to theory building (one which obeys Leibniz's principles of 
relationalism jl4| ) we do not want to introduce factitious elements in any theory. 
Cox adopted a rationalist approach analogous to relationalism — he gave a for- 
mulation of probability theory which ensures that factitious functional relations 
between propositions, quantified using pegs, are never introduced. Instead we 
only maintain those functional relationships that we can justify. Clearly this 
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peg approach might therefore be useful in the quantum gravity domain where 
an exphcitly relational approach is often considered a requirement. One might 
naively argue that, in addition to a relational notion of spacetime, one needs a 
relational notion of probability. In the least it would be prudent to adopt an 
approach to probability theory based upon criteria of rationality rather than ad 
hoc axioms. 

Hence it is plausible that problems like nonlocality are factitious problems 
that are caused by invoking a probability theory which does not have sufficient 
structure. Thus a Bayesian approach may bring locality back to quantum theory, 
just as Einstein bought locality back to gravitational theories in building general 
relativity jl5| — in fact this is the major reason many invoke Bayesian reasoning 
within quantum theory jJS] (although one does not need to adopt as drastic a 
peg theory as we adopt here in order to tentatively deny nonlocality). 

Similarly, the problem of hidden variables might also be factitious. The 
Kochcn-S pecker theorem [T^ seems JHl to prove that we cannot assign defini- 
tive values to variables prior to measurement in quantum theory. This is ob- 
viously compatible with a Bayesian approach to ignorance — we cannot assign 
such values because we are explicitly presuming we are ignorant of them. 

These complex pegs are intimately related to the approaches of Feynman 
and Hartle 'SU' who invoke real 'probabilities' that can lie outside of [0, 1]. 
Hartle's virtual 'probabilities' are explicitly found using the real parts of our 
complex pegs (jSJ which, in turn, were originally introduced by Goldstein and 
Page |21| in their linearly positive histories approach. Thus the linearly positive 
histories and the consistent histories programmes appear naturally within this 
peg framework. If we wish to discuss real Bayesian probabilities we could follow 
the linearly positive histories programme and take the real parts of our pegs and 
ensure a linearly positive condition (22j . Similarly, if we wish to discuss relative 
frequencies we could follow the consistent histories programme and take the real 
parts of our complex pegs and define a consistency condition stronger than linear 
positivity |23| . Even so, we are still wary of invoking complex pegs because they 
are so alien to our usual notion of relative frequency. Note, however, that we are 
not wholly uncomfortable as such complex pegs appear naturally in quantum 
theory — the generalised Berry phase is derived from such complex pegs ,24) and 
is an experimentally verifiable quantity (using ensembles of experiments). As 
such, we can combine such phases and frequentist notions into one probabilistic 
entity using axioms that were outlined over 60 years ago by Cox 2 . 

In the history of science we have been rather ambiguous about what the word 
'probability' means. Some call relative frequencies 'probabilities' even though 
they don't behave in the same manner as the term in common parlance. Simi- 
larly we have called certain non-additive numbers in quantum theory 'probabil- 
ities' even when they do not obey axioms of probability — nor axioms of relative 
frequency (which are necessarily additive). We might call our complex pegs 
'probabilities' because at least they do obey rational probabilistic axioms, but 
perhaps we would make a similar category error or confuse the issue by doing so; 
hence, for the want of a better name, we have resorted to calling them 'pegs' — at 
least it begins with a 'p' so we don't need to change our notation. So far we 
have two different kinds of pegs but there may be more. We have objects that 
obey Cox's two axioms which are real; we might call these 'round pegs'. We also 
have these objects that obey analogues of Cox's two axioms, and our tentative 
third, which are complex. Let us call these 'square pegs'. These names lead 
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naturally to a playful, albeit unfortunately sardonic, metaphor for what we are 
trying to do. A baby metaphor for science perhaps. We are trying to find the 
right-shaped peg for the corresponding hole and we must reject the pegs that 
do not fit snugly. When dealing with a histories algebra we argue that these 
complex pegs fit rather snugly. 

4 Entropy 

With a generalisation of probability to hand, we must also begin to discuss a 
generalisation of entropy — a complimentary concept that is often just as impor- 
tant as a good notion of probability. Perhaps we don't have to search very far 
for such a generalisation. 

First of all, how should a notion of entropy behave? It should behave, in 
part, like a probability. It should probably be a transitive or monotonic notion of 
preference in some sense 123- It should reflect the space of pegs in a natural way. 
Hence the first naive object to suggest is simply a generalisation of Shannon's 
entropy: 

n 

S[P{UI)] --KsY, P(« V) V) (33) 

1=1 

where P{la\I) := {p{ai\I) : i — l,2...Na} and Ks is a constant. Does this ob- 
ject S[-] behave like an entropy should? Does it, for example, obey the grouping 
property |2fij : a property that Shannon suggests is natural for any notion of 
entropy Consider the complete — disjoint and exhaustive — set {a*} split 

up into groups labeled by an integer g. We could consider the peg-entropy H33|l 
of the original set as split up into the peg-entropy of each of the groups and the 
peg-entropy as to which group g — 1,2,..., Nq one should use; this alternative 
way of looking at the peg-entropy 1)33(1 of the set should be equivalent to not 
splitting {a*} into groups (this is the grouping property). How we split the en- 
tropy into groups should not make a difference. We can split up the peg-entropy 
as follows: 

Ng 

S[P{UI)] :- -if5EEp("V)lnp(aV)- (34) 
g ieg 

The complex peg we assign to a group g is simply 

p{g\I) = Y.p{a^\I). (35) 
So now we must ask ourselves whether 

S[PiUl)] - S[P{1g\I)] + J2pig\I)Sg (36) 

9 

where 

Ng 

S[P{1g\I)] := -KsJ2pi9\I) lnp(.g|/) (37) 

9 
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and 



Sg := -i^s5]p(al5/)lnp(«%/). (38) 

ieg 

In order to work this out we need to work out what pegs we should assign 
to the histories {a* : i d g} upon the knowledge that the group g is the correct 
group. Thus we need a notion of conditioning. We need to work out what 
conditional pegs we should use and whether it allows (|33() to obey the grouping 
property. 

Using Bayes' rule in our complex peg framework is quite interesting: 

P(a \gl) = ^-TTT • (39) 

P{9\I) 

It is natural to assign p{g\a^I) = 1 if we know that a' is in the group g (we 
are normalising due to ((SJ c/. 3 ). Hence we should assign 

Pia^\9l)-'-^ (40) 

to conditional grouping pegs. Does this allow the grouping property to be 
satisfied? Note that for y, z G (D we do not necessarily have that In - = Iny — Inz 
because of different branches of the logarithm function; it only works if — tt < 
(arg(a;) — arg(?/)) < tt. Complex logarithms behave as follows: 

ln(re*^) =lnr + i6l (41) 

where we may choose the principle value of 9. Renaming the index i with j so 
as not to confuse it with imaginary components, it is therefore clear that: 

y In ^ y (In + - ^9,) (42) 

where 9j = arg[p(a-' |/)] and 9g = a,rg[p{g\I)]. 

So, using the definition of the complex logarithm, we have that 



-J2pia^\I)\npia^\I) -p(5|/) ^ In ^^-p(5|/) lnp(g|/). (43) 

Thus we do have the grouping property for our test entropy functional (|33|l i.e. 
we have that 

S[P{UI)] = S[P{1g\I)] + J2p{g\I)Sg + 2mni (44) 

9 

where m is an integer — H33|l is satisfied as long as we identify the different 
branches of the logarithm. 

What, other than the grouping property, should an entropy functional obey 
so as to be a useful definition of uncertainty or information? According to 
Shannon S[-] should be continuous in the pegs. When all the pegs are 
equal (and hence realpi = — ) then it should be a monotonic increasing function 
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of n. Hence S[-] should correspond to the standard Shannon entropy for the 
real subset of complex pegs. Clearly (|33|l is very plausible as a generalisation 
of Shannon entropy that is apt for quantum histories. But, like the Shannon 
entropy for real probabilities, is it possible to prove that we must use H33|l 
because it is the only functional that fits the required desiderata up to some 
equivalence of functionals? This we can't yet answer. 

Are strong additivity and concavity also satisfied by this peg-entropy? In 
order to find out we need to define a conditional peg-entropy; presumably this 
involves Bayes' rule which is satisfied by our complex pegs since the A-operation 
is associative Lets define the conditional peg-entropy in an analogous way 
to how we define conditional Shannon entropies: 

S[P{UlpI)] ^p(/3^|/)5[P(l„|/3^/)] (45) 

i 

= -ifs^p(/3^|/)5]p(a'|/3^/)lnp(a^l/3^/). (46) 

3 i 

And thus we can check whether the following 'strong additivity' condition is 
satisfied by S[-]: 



S[P{UMp\I)] = S[P{U\I)]+S[P{lp\l^I)] 

= S[P{UI)]+S[P{U1^I)]. (47) 

We can also check whether the following 'concavity' condition is satisfied: 

S[P{U\I)]>S[P{la,\lpI)]. (48) 

Note that P(l„ A 1/}|/) := {p {a' A (3^1) : « = 1,2, ...n^ and j = 1,2, ...np}. So 
we can work out the LHS of H47I) to be 

S[P{U A 1^|/)] = -Xs^5I^("' ^'^'1-^)- (49) 

3 

We can also work out the RHS (second decomposition) of l|T7|l : 

3 3 i 

Now, p(a* A (3^ \I) = p{a'^\j3H)p{(3^ \I) because Cox's axioms ensure that this 
is the case. Note that X]i-P("^N/3"'-^) = 1 f^'' each j as long as a' all commute 
with (3^ . Hence we can identify the LHS and RHS and thus (|47|l is satisfied for 
sets of commuting histories. Clearly it is natural that strong additivity applies 
for commuting histories because, in such cases, we can easily interpret the two 
sets of history propositions to be compatible. If they do not commute then 
there is no a priori reason we should demand strong additivity, just as there is 
no a priori reason we should demand comparability (by the dubious axiom Ob) 
of probabilities in such cases. 

In order to work out when H48|l is also satisfied by our novel notion of entropy 
we would have to decide what partial order on the space (D we ought to use. 
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Monoticity provides significant constraints upon what partial orders we can use 
and, as we argued above, it seems we should use the partial order illustrated in 
Fig-O fo'" psgs. The partial order on the peg space will inform the partial order 
on the peg-entropy space (although perhaps scaled by the Ks constant). Con- 
cavity might then be satisfied, at least for a subset of histories. Having shown 
that our tentative notion obeys the grouping property (albeit identifying 
branches of the logarithm), it is a matter only of mathematical consequence 
whether our peg-entropy also obeys other convenient identities like strong ad- 
ditivity and concavity (these identities are not axioms per se). It is clear, then, 
that (p!!^ is a plausible generalisation of entropy for quantum history theories 
but we have not yet proved whether all peg-entropies that obey the grouping 
property for complex pegs must be of this form. 

5 Conclusion 

In quantum theory we use Gleason's theorem to justify the probabilistic assign- 
ments we give to projection operators. However, as soon as we begin to discuss 
more than one single projection operator — when we begin to discuss history 
propositions — we have to postulate a notion of state collapse in order to de- 
fine probabilities. However, such postulated probabilities are non-additive and 
many problems or issues arise because of this. From a Bayesian perspective it 
is even dubious to call such things 'probabilities' because they are non-additive 
and thus alien to our normal notion of probability [2H1- Problems with nonlo- 
cality also arise by discussing propositions that involve two or more times (in 
a given frame of reference). Hence it is natural to tackle this problem head-on 
and define a propositional space that includes multi-time propositions. Since 
we do not want to give any causal bias to the peg or probability theory that we 
use |2n| it seems prudent to put timelike and spacelike separated propositions 
on the same footing [30]; hence we might naively like to use tensor products 
to produce history propositions (this is the HPO algebra)^. Rather than pos- 
tulate dubious notions of state collapse one can then derive a monotonic peg 
for such history propositions, and one can do such a thing without getting into 
the problems of non-additivity and, tentatively, nonlocality. There also exists a 
plausible generalisation of Shannon entropy for such pegs. Of course such com- 
plex pegs are alien to our standard notion of probability. However, our standard 
notion of probability is rather alien too; when you get down to it, what really 
does the term 'probability' mean? The interpretation of probabilities is clearly, 
historically speaking, a debatable issue and hence it is necessary to axiomatise 
and formalise a relational approach. Such an approach will ensure that, even if 
we don't know with full clarity what such concepts mean, we will, in the least, 
not introduce functional relationships between pegs that we are not justified in 
introducing. 

So we cannot yet give a clear answer to the question: What are probabilities? 
One can, however, begin to answer another quite daunting question: Why do 
we naturally find complex numbers in quantum theory? 

^Of course, in full generality, one would prefer to use a fully relational propositional algebra. 
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